منابع مشابه
Corpus-based Name Standardization
Variation in the spelling of names has various origins, many of which many are difficult to describe by rule. We present a method that uses both rules and a similarity measure of a probabilistic nature, and which can make use of existing onomastic corpora. Rules first convert an unknown name to a semiphonemic form. Then a selection is made of possible candidates in the onomastic corpus. For thi...
متن کاملEvaluation of recent speech grammar standardization efforts
The “Voice Browser “ activity within the W3C consortium addresses the need for standards for speech grammars, dialogue descriptions etc. in distributed systems. This paper discusses the consortium’s recent speech grammar working draft specification. The W3C specification is based on the Java Speech Grammar Format (JSGF) defined by Sun Microsystems and is with all good and bad qualities characte...
متن کاملEstonian Emotional Speech Corpus
The Estonian Emotional Speech Corpus serves as the acoustic basis for emotional text-to-speech synthesis. Because the Estonian synthesizer is a TTSsynthesizer, we started off by focusing on read texts and the emotions contained in them. The corpus is built on a theoretical model and we are currently at the stage of verifying the components of the model. In the present article we give an overvie...
متن کاملSpontaneous Speech Corpus of Japanese
Design issues of a spontaneous speech corpus is described. The corpus under compilation will contain 800-1000 hour spontaneously uttered Common Japanese speech and the morphologically annotated transcriptions. Also, segmental and intonation labeling will be provided for a subset of the corpus. The primary application domain of the corpus is speech recognition of spontaneous speech, but we plan ...
متن کاملCorpus Child Directed Speech Adult Directed Speech
Cross-linguistic studies on unsupervised word segmentation have consistently shown that English is easier to segment than other languages. In this paper, we propose an explanation of this finding based on the notion of segmentation ambiguity. We show that English has a very low segmentation ambiguity compared to Japanese and that this difference correlates with the segmentation performance in a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Science Journal
سال: 2007
ISSN: 1683-1470
DOI: 10.2481/dsj.6.s806